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1. Introduction 

We provide a development that unifies, simplifies and extends considerably a number of 
minimax results in the restricted parameter space estimation literature. As illustrated 
with a series of examples, the unified minimax result has wide applicability with respect 
to the nature of the constraint, the underlying probability model and the loss function 
utilized. 

To further put into context the findings of this paper, consider a basic situation where 
X ~ N{9, 1), with > a (a > — oo known), and where 9 is estimated under squared error 
loss (d — 8) 2 . Katz [10] established that the Bayes estimator Sjj with respect to the flat 
prior on (a, oo) dominates the minimum risk equivariant (MRE) estimator 5q(X) = X. 
However, 5q remains a useful benchmark estimator with its constant risk matching the 
minimax risk, and with any improvement, such as 5jj, being necessarily minimax as 
well. In a technical sense and roughly speaking, the form (and unboundedness) of the 
restricted parameter space [a, oo) preserves a common structure with the unrestricted 
parameter space 3?, and the constructions of the least favourable sequence of priors for 



This is an electronic reprint of the original article published by the ISI/BS in Bernoulli, 
2012, Vol. 18, No. 2, 635-643. This reprint differs from the original in pagination and 
typographic detail. 



1350-7265 © 2012 ISI/BS 



2 



E. Marchand and W.E. Strawderman 



both problems are isomorphic, leading to the same minimax values. In contrast, the 
restriction to a compact interval 9 £ [a, b] is quite different and lowers the minimax risk 
(see example (C) in Section 3). 

Now, the above phenomenon is not only more general (e.g., general location families 
with absolutely continuous Lebesgue densities and strictly convex loss [5]), but similar 
results have been established in various other situations, beginning with Blumenthal and 
Cohen [2] in the context of ordered location parameters. Many other such contributions 
will be referred to below, but at this point we refer to the monograph of van Eeden [22] , as 
well as the review paper by Marchand and Strawderman [17], which contain a substantial 
amount of material and references relating to such problems. 

In this paper, we provide a unified framework for the above-mentioned problems, as 
well as many others either for more general loss and/or model, or for new situations such 
as estimating quantiles or covariance matrices under parametric restrictions. While re- 
sults for certain of the problems (e.g., (A) and (B)) are not new (although we generalize 
some to more general loss functions) and certain others have been studied for squared 
error loss, we greatly expand the set of loss functions for which minimaxity is established 
(e.g., (C)-(F)), certain of the problems (e.g., (G) and (H) and Remark 2) have not been 
extensively studied and thus our problems and results are mostly new. In Section 2, we 
formalize the general argument, relying on the existence of a least favourable sequence 
(Proposition 1), setting up conditions on the restricted parameter space that facilitate 
a correspondence with the above sequence (Theorem 1) and inferring (Corollary 1) that 
a minimax MRE estimator remains minimax with the introduction of a restriction on 
the parameter space under given conditions. Detailed examples follow in Section 3. These 
include the estimation of location or scale parameters under a lower (or upper) bound 
restriction, location parameter vectors restricted to a polyhedral cone, scale parame- 
ters subject to restricted ratios or products, linear combinations of restricted location 
parameters, location parameters bounded to an interval with unknown scale, quantiles 
for location-scale families with parametric restrictions and covariance matrices with re- 
stricted traces or determinants. 

2. Main result 

We begin with the following fact concerning minimax problems, presented as a synthe- 
sized version of parts of the Appendix of [3], pages 254-268. 

Proposition 1. Let R < oo be the minimax value in a problem with sample space X 
and parameter space Q, both Euclidean. Suppose the probability measures are absolutely 
continuous with respect to a a-finite measure, that the loss L{9, •) is lower semicontinuous 
on the action space and that L(9,a) — > b(9) = supL(0, ■) as \\a\\ — > oo for all 9. Then there 
exists a sequence of prior distributions with finite support and with Bayes risks equal to r n , 
such that r n approaches R as n — » oo, and there also exists a minimax procedure. 

We make use of a classical framework for invariant statistical problems. This in- 
cludes a group of transformations G with an invariant family of probability mea- 
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sures {Pg: 9 £ Q}, where X ^ Pg and X' = g(X) implies X' ~ P#/ with 0' = 50 and 
G = {.9 : .9 G C} forming a corresponding group of actions on fi. As well, for estimating 
a parametric function t(9) with loss i, additional assumptions include the condition 
that r(g9) depends on 9 only through r(£>), and that the group action on the decision 
space D satisfies the condition L(g9,g*d) = L(9,d) for all 9,d (e.g., [16], Section 3.2). 
With the help of Proposition 1, we obtain the following result. 

Theorem 1. Let a problem satisfying the conditions of Proposition 1 be invariant under 
a group G and let 6q(X) be minimax for a full parameter space ft. Suppose now that 
the parameter space is restricted to a subset Q* ; that there exist sequences g n 6 G and 
B n C fl* , such that g n B n C g n +iB n+ i; and that {J n g n B n = Q. Then Sq(X) remains 
minimax in the restricted parameter space problem. 

Proof. Let ir n , S n and r n be, respectively, Proposition l's sequence of priors, sequence 
of corresponding finite supports and Bayes risks, with r n — > R as n — >• oo. Choose m(n) 
sufficiently large so that m > m(n) implies g m B m D S n . As we show below, the prior 
distribution with finite support S* = <7~ („) (Sn) C fi*, given by n*(0) =T n (s~( n )9), has 
Bayes risk r* = r n . This implies directly that <5o(A) is minimax, since r* = r n — > R, as 
n — >• oo by Theorem 5.18 of [1]. It remains to show that the Bayes risks of 7r„ and 7r* 
coincide, and a standard argument is as follows. Let S(X) be any estimator. Then, for 
its risk, we have: 

R(6,6) = E e L(9, S(X)) = EggL(9,6(g~ 1 X)) = E- g gL{g9 1 g*5(g- 1 X)) = R{g9 1 g*5{g- 1 X)) 1 

by invariance. It follows that, if we set 9 = g m ( n )9, then 

r„ = E e [R(9,S n (X))}^E%R(^ n) 9,g* m{ 1 n) S n (g m{n) X))} 

= E S [R(6,g*- { l ) 5 n (g m(n) (X)))]=r* n , 

where S n (X) is the Bayes estimator corresponding to n n and hence g^^6 n (g m ^(X)) 
is the Bayes estimator corresponding to tt* . □ 

For applications, we will take B n of Theorem 1 to match fl* , but it is potentially 
more convenient to take B n as a sequence of open neighborhoods. Now, since the best 
equivariant estimators are often minimax, we deduce the following widely applicable 
result. 

Corollary 1. If an MRE estimator in a given problem satisfying the conditions of The- 
orem 1 is minimax, then it remains minimax in the restricted problem provided the re- 
stricted parameter space Cl* satisfies the conditions of Theorem 1. 

For the sake of clarity, we do not assume that the action space and the image of the 
restricted parameter space coincide. Hence, minimax estimators that can be derived from 
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Theorem 1 or Corollary 1 are not forced to take values in Q* . The main motivation resides 
in the benchmarking (i.e., dominating estimators that take values in £1* are necessarily 
minimax) and preservation of minimaxity (the minimax risks on f2 and f2* are equivalent). 
An important class of further applications of Corollary 1 will arise in cases where <5mre 
is minimax for the unrestricted problem Q and the parameter space fi* and loss L(9, ■) 
are convex, in which case the projection of <5mre onto ft* will dominate <5mre and hence 
be minimax. 

Remark 1. Notwithstanding the conditions required on the restricted parameter 
space f2*, the applicability of Corollary 1 hinges on the minimaxity of the best equiv- 
ariant estimator, in particular for unrestricted parameter space versions. As studied and 
established by several authors, it turns out that it is frequently the case that a minimax 
equivariant rule exists. We refer to [1], Section 6.7, [20], Section 9.5 and [16], note 9.3, 
pages 421-422, for general expositions and many useful references. In particular, the 
Hunt-Stein theorem gives, for invariant problems, conditions on the group (amenability) 
that guarantee the existence of a minimax equivariant estimator whenever a minimax 
procedure exists. [11] is a key reference. All of the examples below relate to amenable 
groups, such as the additive and multiplicative groups, the group of location-scale trans- 
formations and the group of lower triangular p x p non-singular matrices with positive 
diagonal elements. 

3. Examples 

We focus here on various applications, illustrating how the results of Section 2 apply to 
both existing and new results. We accompany this with further observations and remarks. 
As previously mentioned, such applications are quite varied with respect to model, loss 
and shape of the restricted parameter space. At the expense of some redundancy, some 
particular cases are singled out (e.g., (A) is a particular case of (D), while (B) is a par- 
ticular case of (E)) for their practical or historical importance. However, we do not focus 
here on specific determinations of the MRE estimators, but do refer to textbooks that 
treat in detail such topics (e.g., [16]). Throughout, we consider loss functions that satisfy 
the conditions of Proposition 1, and our findings relate to univariate and multivariate 
continuous probability models with absolutely continuous Lebesgue densities. 

(A) (A single location parameter.) Consider a location model with X ~ fo(xi — 
9,...,x n — 9), with O = 5R, known /o and invariant loss p(d — 9). Consider 
further a lower (or upper) bounded parameter space fl* (i.e., f2* = [a, oo) or 
il* = (— oo,a]). On one hand, f2* satisfies the conditions of Theorem 1 with the 
choices B n — f2* , g n = —n for Q* = [a, oo) (and B n — SI*, g n = n for 51* = (— oo, a]). 
On the other hand, following [11] or [7] for squared error loss, the MRE or Pitman 
estimator is minimax (and also Bayes with respect to the flat prior for 9 on 3ft). 
Thus Corollary 1 applies and the MRE estimator is minimax as well for the re- 
stricted parameter space Jl*. The result is not new (see, e.g., [10], for a normal 
model and squared error loss; [5], for strictly convex p; [18], for strict bowl-shaped 
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losses) . Finally, we mention the implication that the minimaxity property is hence 
shared by any dominator of the MRE estimator, which includes quite generally 
the Bayes estimator of p associated with the flat prior on Q* (e.g., [5, 18]). 

(B) (A single scale parameter.) Analogously, consider scale families with densities 
"4r/i( £ N---! £a )j with natural parameter space = 3? + , known fi, invariant loss 
p(d/a) and restricted parameter spaces 0* = [a, oo) or ft* = (0, a] (with a > 
known). With the multiplicative group on these restricted parameter spaces 
satisfy the conditions of Theorem 1 with B n = f2* and the choices g n — — and 
g n = n for f2* = [a, oo) and Q* = (0, o], respectively. From [11], whenever a mini- 
max estimator exists for the unconstrained case a > 0, it is necessarily given by 
the MRE estimator or equivalently by the Bayes estimator with respect to the 
non-informative prior n(a) = — J(o l00 )(c). Thus Theorem 1 and Corollary 1 ap- 
ply and such MRE estimators remain minimax for constrained parameter spaces 
[a, oo) and (0,a]; and this quite generally with respect to model f\ and loss p. 

A version of the above minimaxity result for strict bowl-shaped losses was ob- 
tained by Marchand and Strawderman [19]. Kubokawa [12] provided the result for 
entropy loss (i.e., p(z) = z — logz — 1), while van Ecdcn [22] provided the result 
(actually more general, which relates to a vector of scale parameters as in (E) 
below) for scale invariant squared error loss (i.e., p(z) — z 2 ). Also, we refer to 
the three last references for earlier results obtained for specific models fi , namely 
gamma and Fisher models. Finally, we also point out that the above development 
applies to estimating powers a r of a by the transformation Xi — > x\ (e.g., [19], for 
more details). 

(C) (Location-scale families with the location parameter restricted to an interval (pos- 
sibly compact).) For location-scale families with observables Xi, . . . ,X n having 
joint density -\ f2{ Xl ~ >l , ■ • ■ , x "~ fl ), consider estimating p with a > (unknown) 
under cither: (i) the compact interval restriction p £ [a,b], or (ii) p £ [a, oo); 
/2 known, invariant loss p(^—^). For (i), Theorem 1 applies with B n = O*, 
^ = ( -_ 2 M !n ) ) and &n n* tr)G»x»+: pe [- 2£ ^ 2l , 2 ^ 2i ],o->0}. 
As well, Kiefer [11] tells us that the MRE estimator of p or, equivalently, Bayes 
with respect to the Haar right invariant prior tt(p, a) = ^l(o,oo) ( <T )i is minimax for 
the unrestricted problem with £7 = K x K + (subject to existence). The conclusion 
derived from Corollary 1 is that <5mre is also minimax for the restricted parame- 
ter space with p £ [a, b],cr > 0, while a similar development and conclusion applies 
for (ii) with B n = ft* and g n = (— n, 1), a result of which also follows from (G) be- 
low. The result for compact interval restriction (i) generalizes the result previously 
obtained for scaled squared error loss (i.e., p(z) = z 2 ) by Kubokawa [13]. 

Finally, we point out that a compact interval restriction on p with known a 
typically leads to a different conclusion, with a corresponding MRE estimator 
that is not minimax. A somewhat familiar justification for this (e.g., see [16], 
page 327 for a normal mean p and squared error p) is as follows. Consider p 
to be strictly bowled-shaped in the sense that p'(-) is positive on (0,oo) and 
negative on (— oo,0). Denote Vq and <5tmre as the constant risk of <5mre and 
the truncation of i5mre onto the parameter space [a, 6], respectively. Observe that 
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Vq = R([a,Smre) > -R(m,<5tmre) for all p s [a, b], and that the compactness of the 
parameter space coupled with the continuity of the risk i?(^, <5tmre) imply that 
sup^guw R(p, 5tmre) < Vo and that, consequently, Smre is not minimax. 
(D) (Location parameters restricted to a polyhedral cone.) Consider independently 
generated copies of X ~ /o(&i — — with /o known, and p 

, . . . , restricted to a Polyhedral cone 

n* c = {peW: Cp>0}, (1) 

where C(g x p) (q < p) is of full rank (and the is a q x 1 vector of O's). Such 
restricted parameter spaces include: 

(i) orthant restrictions where some or all of the p^s are bounded below by 0; 

(ii) order restrictions of the type p± < pi < • • • < Pr with r <p; 

(hi) tree order restrictions of the type p\ < pi for some or all of the pi's; 
(iv) umbrella order restrictions of the type p\ < pi < ■ • ■ < A*m > ■ * ■ ^ A*p 
(m known). 

With B n = CIq and g„ e 5ft p as the additive group elements such that Cg ra = 
— n(l, . . . , 1)', we obtain <? n f2£; = {^6 C^i > —nil, . . . , 1)'} and choices that 
satisfy Theorem 1. Furthermore, for invariant losses p(\\d — fi\\), the results of 
Kiefer [11] tell us that, subject to risk finiteness, the MRE or Bayes estimator 
for /.t with a flat prior on !R P is minimax for the unconstrained problem /i € 5R P . 
We infer by Corollary 1 that the same estimator is minimax for any polyhedral 
cone fip as in (1). 

Other than problems in (A), the above unifies and extends several previously 
established results, beginning with the Blumcnthal and Cohen [2] case of order 
constraints and squared error p, and including more recent findings by Tsukuma 
and Kubokawa [21] for multivariate normal models, the general constraint in (1) 
and squared error p (also see [15] and [22], for further results and references). 
A much-studied and important case is the normal model with X ~ N p (p,I p ), 
p E and loss ||d — p\\ 2 , for which the above results apply with Smre(X) = X. 
As an interesting corollary of a result by Hartigan [8] and of the above, it follows 
that the Bayes estimator Su of p with respect to a flat prior on fi c , which Hartigan 
showed dominates X, is minimax for f2^. To conclude, we point out that a par- 
ticular case of Hartigan's result was obtained by Blumenthal and Cohen [2] for 
ordered location parameters in (ii) with r = p = 2. They actually provide a class 
of model densities / . including normal, uniform and gamma densities, through 
conditions that ensure that Su (also referred to as the Pitman estimator by the 
authors) is minimax under squared error loss. They also report on numerical evi- 
dence indicating that 5jj is not minimax in general with respect to /o. 

Remark 2. As an extension of the above, a similar development holds with the 
introduction of an unknown scale parameter a (a > 0). Indeed for (at least two) 
independent copies from density ^-/2( Xl ~ Ml , ■ • ■ , Xp ~ fJ,p ^ invariant loss p{ ^ d ~^ ) 
and restricted parameter space p € £l* c , a > 0, Theorem 1 and Corollary 1 apply 
as above, but with the MRE estimator of p now being generalized Bayes with 
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respect to the prior measure ir(p,cr) = il( 0iOO )(o")lsRp (/•*)• Moreover, if estimating 
an unconstrained a (or a r ) is the objective, the MRE estimator of a can be 
shown to be minimax as well with the parametric restrictions (subject to risk 
finitcness). This means that any minimax estimator of a r for an unconstrained 
problem remains minimax even when p £ Qq. 

(E) (Ratios or products of scale parameters.) For independently generated copies 
of X ~ (n,^)" 1 /^^,---,^) with /i a known Lebesgue density (on (5R+) P ), 
H = (5R + ) P , consider the restriction r = rL(°i) r< — c > 0; with the 7*j's known 
and estimating r under invariant loss p(-). The parametric function r includes 
interesting cases of ratios 2t anc l products (JiUj (with or without nuisance pa- 
rameters (Tfc, k ^ and the constraint on r represents a natural scale pa- 
rameter analog of (1) with q = 1. With £?„ = f2* , and <? n £ (5R + ) P the multi- 
plicative group element given by g n = (n~ 1 / Tl , . . . , n~ 1 ' Vp ), we obtain g„Q* = 
{(ax,...,a p ) £ (5R + ) P : nL( cr i)'" i > Thus, the conditions of Theorem 1 are sat- 
isfied, and Theorem 1 applies. Corollary 1 applies as well, by virtue of Kiefer [11], 
indicating that the MRE estimator (if it exists) , or equivalently Bayes with respect 
to the prior measure Yii ^l(o,oo) remains minimax for estimating r under the 
lower bound constraint above. We refer to [16], Chapter 3, problems 3.34-3.37 for 
examples. Finally, with the minimax result here being quite general with respect 
to the loss p (as well as with respect to the type of constraint and the model), 
we point out that the particular case of scale- invariant squared error loss (i.e., 
p(z) = (z — l) 2 ) is covered by van Eeden [22], Lemma 4.5. 

(F) (Linear combinations of restricted location parameters.) Consider location mod- 
els with X = (X\, . . . , Xk)' ~ Yii fi( x i ~ Pi), known /,'s, where we wish to estimate 
8 = Yli=i a i^i = a Vi under loss p(d — 6) and the restriction p £ CI* = {p £ 5ft": Pi>0 
for i = 1, . . . , k}. For the unconstrained version with p £ ft = 5i fe , the MRE esti- 
mator (also Bayes with respect to the flat prior on 5R fc ) is minimax [11], subject 
to existence and risk finiteness. Hence, Corollary 1 (or Theorem 1) applies with 
B n = Cl*,g n = (—71, • ■ • , —n) indicating that an MRE estimator remains minimax 
in the constrained problem p £ 0* for estimating 9. Kubokawa [14] has recently 
established the above for squared error loss, where the MRE estimator, whenever 
it exists, is the unbiased estimator Yli=i a i(Xi — h) with E(Xi — Pi) = h. The 
result is extended here with respect to p, and achieved with a different and more 
general proof. 

(G) (Quantilcs with parameter space restrictions.) Consider location-scale models with 
(Xi,.. .,X m )' /o(^); m > 2, / known, p £ % a > 0; with the objec- 
tive of estimating a quantile parameter p + ija; (of known order J2 00 fo{z)dz) 
under invariant loss p( d ~^~ ,,(:r )- Now, consider restricted parameter spaces such as: 

fi*={( A i,cr)£SRx5R + : p + rja > 0} 

and 

Q* 2 = {(p,a) £SRx3?+: p>a,a>b>0}. 
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Taking B n = VL\ and g n = (-n, i) such that g n ^\ = {{^^) E 5ft x 5ft + : p + ncr > 
—n} and g n ^2 = {(/■*> CT ) € 5ft x 5ft + : /i > — — n,a> — }, we see that the conditions 
of Theorem 1 arc satisfied. Moreover, subject to existence or risk finiteness, the 
results of Kiefcr [11] tell us the MRE estimator is minimax for the unrestricted 
parameter space O = 5ft x 5ft + . Hence, Corollary 1 applies and tells us that such 
MRE estimators are minimax for restricted parameter spaces VL\ and Q^- Pre- 
viously studied models, for which the above results apply, include exponential 
and normal /o's. For instance, consider a standard normal /o and squared er- 
ror p, where equi variant estimators are of the form X + ncS, 5mke(Xi, . . . , X rn ) = 
X + nc m S, with constant and minimax risk 1 + i] 2 (l — (m — l)c^J, and where 
X = i Eti *i, S 2 = EtiiX - Xf and c m = ^gg^ ([6], page 182). The 
general result above tells us the <5mre remains minimax for parameter spaces Sl^ 
and under squared error loss. 

Observe also that the above development relative to ill is still valid when- 
ever rj = 0, which relates to the problem of estimating a median or mean for 
symmetric /o's, with the corresponding minimaxity result previously obtained by 
Kubokawa [12] for scale-invariant squared error loss (i.e., p(z) = z 2 above). Similar 
results follow with an upper bound of for £l\ , as well as an upper bound for [i 
and/or an upper bound for a in the case of fij- Finally, we point out that the min- 
imaxity result and development above follow without emendation for the general 
case of non-independent components with joint density . . . , gm ~^ ). 

(H) (Restricted covariance matrices.) Consider a summary statistic S ~ Wishart(E, 
p, m) with m > p and S positive definite. Moreover, suppose that we wish to 
estimate S with invariant loss (under the general linear group) S) = ip(E~ 1 5), 
with ip(y) = tr(y) — log|y| — p and ip(y) = tr(y — I p ) 2 as interesting examples. 
A standard method to derive a minimax estimator here (e.g., [4], Section 6.2) 
is to consider the best equivariant estimator under the subgroup of lower 
triangular matrices with positive diagonal elements. Such equivariant estimators 
can be shown to have constant risk, be of the form Sa(S) = (S 1 / 2 )A(S 1 / 2 )' with A 
symmetric and S 1 / 2 the unique square root of S element belonging to and with 
the optimal choice (MRE) being minimax. For instance, under loss tr(E -1 <5 — I p ) 2 , 
the BEE is minimax and given by 8a , with Aq the diagonal matrix with elements 
(m+p-21 + l)- 1 ; i = l,...,p; [9]. 

Now, consider restrictions on S of the type ST = {£ > 0: |E| > c\ > 0} or 
n* = {E > 0: tr(E) > C2 > 0}. It is easy to see in both cases that the conditions of 
Theorem 1 apply with g n = -^I p and B n = O* . Hence, the above MRE estimators 
remain minimax under the above restrictions by virtue of Corollary 1. 



Concluding remarks 

We have provided in this paper a rich and vast collection of novel minimax findings 
for restricted parameter spaces. Furthermore, we have established a unified framework 
not only applicable to many new situations, but also covering many generalizations of 
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existing minimax results with respect to model and loss. For the sake of clarity and in 
a summary attempt to draw a sharper distinction between existing and new results to 
the best of our knowledge, we point out or reiterate that: 

• Results in (A) and (B) are not new except for the slight generalization on the loss 
with our results here applicable to losses that are not necessarily strictly bowl- 
shaped. 

• Situations (C)-(F) have been studied by others with existing minimax results for 
squared error p. Our results cover more general losses p in all these cases. 

• Remark 2, situations (G) and (H), correspond for the most part to new problems 
and the given results are novel. 
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